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Rock image classification using image processing has been practiced to assist 
trained geologists in decision making. However, the study of microstructures 
of rocks and their use in geological investigations offer challenges in the 
areas of Image processing and Pattern Classification due to the stochastic 
nature of the mineral textures that is revealed at the microscopic level. 
Locally relevant Igneous Rock Microstructure images were classified from 
Volcanic and Plutonic Rock subtypes. The imaging method used mineral 
grain size as the key physical feature of classification. Three algorithms, 
namely Radial Basis Function (RBF) Support Vector Machine classifier; 
Improved (RBF) Support Vector Machine classifier; and AdaBoost algorithm 
with Improved RBF Support Vector Machine algorithm as base classifier, 
were used as a base classifier in a novel ‘Iterative Improved Learning (IIL)’ 
approach. Implementing the IIL approach in the chosen algorithm resulted in 
accurately classified images that were added to the training set to enhance the 


‘breadth and depth’ of the learning knowledge. The algorithm iterated 
through all available classifier approaches and compared the inter-classifier 
performance and knowledge of the misclassified images accumulated during 
the execution of all algorithms. 
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1. INTRODUCTION 

As interdisciplinary applications, image processing and pattern classification techniques have been 
used for rock type identification. Most of this work is reported on handheld rock specimens [1], [2]. Studies 
of rock microstructures and applications of texture analysis-based approaches have been rare. This is mostly 
due to the stochastic nature of texture formed by minerals. 

In this paper, microstructure image classification of Igneous rocks found in Volcanic and Plutonic 
subclasses are reported. Volcanic Igneous rocks are formed due to rapid cooling of lava on the Earth’s 
surface. The fast cooling ensures that mineral grains do not grow to a large size. Plutonic rocks are formed 
below the Earth’s crust due to slow cooling of the magma. Since the process of cooling is slow, the minerals 
tend to grow larger [3]. 

The basis for classification is the mineral grain size. The textures are analyzed using Haralick 
features [4] and a Laws’ Mask-based [5] approach. Support Vector Machines (SVM) are used for 
classification. SVM based classification is successfully used in many applications ranging from medical 
applications such as MRI Brain image classification [6] to archeological applications such as monument 
classification [7]. 
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For a Support Vector Machine, proper tuning its controlling parameters is important. Genetic 
Algorithm (GA) based approach is used to avoid trapping of SVM into local minimum [8], [9]. Rather than 
following such heuristic search based complex approaches, for the classification purpose in the chosen 
research work, A Radial Basis Function Support Vector Machine (RBFSVM) classifier is used. The 
controlling parameters C and Sigma are optimized using Grid search [10]. In addition, a classifier 
combination-based AdaBoost algorithm approach is used for classification.Boosting algorithm integrates 
weak classifiers. It has strong applicability for small size and high dimensionality data. It is easily 
programmable and highly adoptable [11]. 

The percentage accuracy for classification is calculated after each algorithm is implemented on the 
chosen image database. A progressive improvement is seen when an elaborate classification approach such as 
AdaBoost is used, after starting from a conventional RBFSVM approach. When the correctly classified 
images are added to the training image set, the ‘Improved Learning’ results in an improvement 
in % Accuracy. 

The following sections provide the detais of the research work. Section 2 provides details of the 
database of images used. In Section 3, details of Haralick features and Laws’ Mask details are provided, and 
the process of selection of the feature pair is presented. In Section 4, the concept of Improved Learning (IL) 
is detailed. Section 5 describes the implementation using a flow chart and outlines the execution of the 
algorithms. Section 6 provides the details of results obtained. Finally, Section 7 consolidates the various 
conclusions drawn from the IL-based approach. 


2. DATABASE OF IMAGES USED 

In this study, locally relevant Igneous rock microstructure samples were used. Images belonging to 
the Basalt, Andesite, Spherulite, Pseudotachylites, Pegmatites, Dolerite, Rhyolite subfamilies were chosen. 
These varieties are abundant in the Western and Northern parts of the country in areas such as Goa, 
Maharashtra, Gujrat, Rajasthan, the Arawali Mountains, and Kumaun [12], [13], [14], [15], [16]. The 
structural analysis of these subfamilies is vital for construction and irrigation purposes. A total of 128 images, 
64 each from the Volcanic and Plutonic categories, with a fair representation of each sub-family mentioned 
above, was considered. 


3. FEATURE SELECTION 
Haralick Features and Laws’ Masks were used to analyze the stochastic textures examined here due 
to diverse mineral combinations at the microstructure level. The Haralick features considered were 


a. Contrast 

b. Energy 

c. Entropy 

d. Homogeneity 
e. Correlation 


And the Laws’ features considered were 
a. Absolute Mean 
b. Standard Deviation 

To improve the likelihood of generalization, a small set of features, usually from the original input 
variables, is generated via feature selection. In the feature selection step, redundant or meaningless features 
are discarded to achieve higher generalization performance and faster classification compared to the initial 
set of features. 

The approach starts from an empty set, and features are added continuously while progressively 
checking the classification method’s performance using a suitable classifier. This approach is called the 
forward selection approach [17]. 

A Radial Basis Function Support Vector Machine Classifier is used for classification by 
implementing the Forward Selection approach. Using all seven features, every possible feature pair was 
created, and the classification performance was evaluated. 

Thirty-two Volcanic rock images and 32 Plutonic rock images were selected at random using the 
MATLAB function randperm(n) for each trial. One thousand such trials were carried out, and the Average 
False Rejection Ratio (AFRR) was calculated. The AFRR for all 21 duplets is as shown in Figure 1. 
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Figure 1. Average FRR for 21 duplets 


The best AFRR of 0.125 was reported for Contrast and Abs. Mean (Feature Pair—1, 6). 
Consequently, triplets were formed with Pair (1, 6) as a base, adding a third feature and then checking AFRR. 
The performances of triplets, quadruplets are shown in Figure 2. Because there is no improvement in the 
False Rejection Ratio, compared to the cost of added computation time, the best performance is selected 
using the Contrast and Laws AM feature pair; this pair is then selected for improved learning-based 
experimentation. 
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Figure 2. Average FRR variation with addition of features 


4. IMPROVED LEARNING 

Achieving the desired balance between stability and plasticity attributes in a classifier is critical. A 
classifier requires plasticity for the integration of any new knowledge, but it also requires stability to prevent 
the loss of previous knowledge [18]. 

Support Vector Machines are stable classifiers that exhibit better classification performance than 
many other machine learning methods [19]. However, stable SVMs suffer from a lack of plasticity and are 
prone to the catastrophic forgetting phenomenon. Catastrophic forgetting is defined as an event that occurs 
after a classifier is trained on the first task and then on another task, which results in the classifier forgetting 
to perform the first task. 

Therefore, to fully benefit from the SVM Classifier performance, an Improved Learning method 
needs to be applied to the SVM, to retain its stability, while implementing plasticity [20]. 

A classifier can be eligible for improved learning by satisfying the following criteria: 


a. It should be able to learn additional information from the input data. 
: It should not require access to the original data used to train the existing classifier 
G: It should preserve previously acquired knowledge i.e., it should not succumb to catastrophic 
forgetting [21] 
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5. IMPLEMENTATION 
For training purposes, a holdout-based training approach was used [22]. 64 Volcanic Rock images 
and 64 Plutonic Rock images, 32 from each class, were randomly selected from the image database. Overall, 
10000 such training set combinations were attempted. The training set that gives the highest % Accuracy was 
considered. The Algorithm is as follows: 
a. Compute Percentage Accuracy for Radial Basis Function SVM, Optimized Radial Basis SVM and 
AdaBoost Classifier with SVM as the base classifier. Identify the misclassified images in each case. 
. Add the classified images to the Training set to improve its ‘Learning’. 
c. Use the revised Training set and classify the remaining images, misclassified in an earlier iteration, 
using the ‘Improved and more intelligent’ training dataset. 
Identify the misclassified images in each case again. 
Repeat steps | to 4 if there is a reduction in the number of misclassified images. 
f. Else, stop the algorithm because no further improvement is possible and no ‘addition’ in the training set 
is made for ‘improved learning’. 
The flow chart showing the implementation is as follows Figure 3: 


oa 


Training Set: X Samples 


Classify N-X Samples using alzo 


v= 1,2....T 
1. RBF SVM 

2. Optimized SVM 

3. Adaboost 


Check for no. of misclassified 


Record % classification accuracy 


Modify training set 


X= N/2+ (N/2- (ni) ) 


Figure. 3 Flow chart of implementation 
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6. RESULTS & DISCUSSION 

Table 1 precented about Algorithm-SVM, Table 2 precented Algorithm-Optimized SVM, Table 3 
precented Algorithm-ADABOOST. The Improvement in percentage Accuracy for each algorithm is due to 
the Improved Learning approach used. Figure 4, highlights the details 


Table 1. Algorithm-SVM 


Iteration 1 2 3 
No. of Training Images 64 64 + 56 = 120 120+1=121 
No. of Classification Images 64 8 7 
No. of Misclassified Images 8 7 7 
Details of Misclassified P = 8,29,30,34,40,47 P = 8,29,30,40,47 P = 8,29,30,40,47 
Images V= 13,17 V= 13,17 V= 13,17 
% Accuracy 87.5 % 89.06 % 89.06 % 
Comment Base accuracy Improvement through End because of no further 
learning improvement 
Table 2. Algorithm-Optimized SVM 
Iteration 1 2 3 
No. of Training Images 64 64+57=121 121+2=123 
No. of Classification Images 64 7 5 
No. of Misclassified Images 7 5 5 
Details of Misclassified P = 8,29,30,34,40,47 P = 29,30,40,47 P = 29,30,40,47 
Images V =13 V=13 V=13 
% Accuracy 89.06 % 92.18 % 92.18 % 
Comment Base Accuracy Improvement through Learning End because of no 
further Improvement 
Table 3. Algorithm-ADABOOST 
Iteration 1 2 3 
No. of Training Images 64 64 + 59 = 123 123+ 1 = 124 
No. of Classification Images 64 5 4 
No. of Misclassified Images 5 4 4 
Details of Misclassified P = 29,30, 40,47 P = 29,30,40 P = 29,30,40 
Images V=13 V=13 V=13 
% Accuracy 92.18 % 93.75 % 93.75 % 
Comment Base Accuracy Improvement through End because of no further 
Learning Improvement 
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Figure 4. Improvement in % accuracy with improved learning 


7. DISCUSSIONS 

a. An improvement in percentage accuracy is observed with improved learning 

b. Classifier Ensemble approaches, i.e.the AdaBoost approach with an SVM as the base classifier 
performs the best among the algorithms used for this study. 

c. The feature selection was implemented using a Forward Selection algorithm. Contrast and Laws 
Absolute Mean are the identified feature pair. Owing to the governing Pressure and Temperature 
conditions, Volcanic Rocks have smaller mineral Grain sizes, and Plutonic rocks develop grains of 


Iterative improved learning algorithm for petrographic image classification ... (Ashutosh Marathe) 


294 


o ISSN: 2088-8708 


larger size. The Volcanic and Plutonic Rock constellation on the 2-Dimensional feature plot is shown 
in Figure 5. 


Volcanic 
Laws AM 


Plutonic 


Contrast 


Figure 5. Typical data distribution for volcanic and plutonic rocks 


Some images remain misclassified even though the Improved Learning uses all of the algorithms. Some 
images that were misclassified by RBF-SVM and Optimized RBF-SVM were correctly classified by the 
AdaBoost approach. This confirms its relatively superior performance in terms of classification 
accuracy, as demonstrated by Figure 4. 
IILA is a comprehensive procedure that tests a database using all algorithms of interest. It also indicates 
the ‘distinct outliers’, which are not classified by any of the selected classifiers. The physical parameters 
governing the generated feature values can then be specifically investigated. 
Misclassification is attributed to the overlapping region shown in Figure 5. The misclassified images, 
even after applying the Improved Learning Based AdaBoost algorithm (Namely—Volcanic No. 13; 
Plutonic No. 29, 30, 40) demonstrated ‘outlier’ attributes. The outliers exhibit interim ‘Hypabyssal’ 
group feature attributes [23] and land in the overlapping region. Hence, these findings are consistent 
with the Geological properties. 

A summary of research approaches used for Petrographic Image Analysis is shown in Table 4. It is 


seen that authors of this paper report the first of its kind use of Support Vector Machine based method and 
Classifier combination methods are practiced for Petrographic Image Classification. The Accuracy reported 
is around 94%. Majority of the authors have tried to classify Igneous, Metamorphic and Sedimentary rocks. 
The present paper is a unique research activity of classification of igneous rocks into its subfamilies. This 
study was carried out on an Intel 13 processor working at 2.10 GHz and used MATLAB version R 2012 b. A 
total 26.2 seconds was required to execute the IILA program. ILA had 2 iterations, as reported. 


Table 4. Summary of Research Approaches used for Petrographic Image Analysis 


Sr.No. Authors Approach % Accuracy Comments 
Baykan, Yilmaz [24] 3-layer feedforward network with Variable between Study of total 5 minerals 
Minimum Square Error Correction 81 to 98% belonging to Igneous, 
Color Space Based classification Metamorphic and 
Sedimentary rocks 
Marmo, Amodiao et. al [25] Multilayer Perceptron (MLP) 93.3% Ancient (Phanerozoic) 
Network Carbonates from marine 
environments were 
classified 
N.Singh, T.N.Singh et. al Multilayer Perceptron (MLP) 92.22% Study of Basalt rock 
[26] Network 
Bodziony et.al [27] Tree Automata and Graph 84.13% Development of Expert 
Automata theory System for Petrography and 
Mechanical analysis of 
Rocks 
Ishikawa, Gulick [28] Training of neural networks based 83% Spectral mapping of Key 
on Raman Spectra minerals characterizing 
composition of igneous 
rocks 
6 A. Marathe et al (in the i. Support Vector Machine (SVM) 94% Texture based Locally 
present paper) based methods relevant Igneous rock 
ii. Classifier Combinations — petrographic image 
Adaptive Boosting classification 
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8. CONCLUSION 

It can be concluded that the Classification of Igneous Rocks into Volcanic and Plutonic subfamilies 
can be carried out using Support Vector Machine based approach and classifier combinations. Fine tuning the 
penalty parameter and variance for the Radial Basis Function SVM, optimal performance can be extracted 
from the SVM Classifier. By using the optimized SVM as the base i.e. weak classifier, using Adaptive 
Boosting enhancement in classification accuracy can be achieved. Further improvement in classification 
accuracy can be achieved through the use of iterative Improved Learning approach. The iterative learning 
provided information about images those remain misclassified despite iteration. This information can be used 
for validation of outliers. 
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